Contextual Values of Observables in Quantum Measurements 
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We introduce contextual values as a generalization of the eigenvalues of an observable that takes 
into account both the system observable and a general measurement procedure. This technique leads 
to a natural definition of a general conditioned average that converges uniquely to the quantum weak 
value in the minimal disturbance limit. As such, we address the controversy in the literature regard- 
ing the theoretical consistency of the quantum weak value by providing a more general theoretical 
framework and giving several examples of how that framework relates to existing experimental and 
theoretical results. 
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The weak value (WV) of a quantum operator A was 
introduced in 1988 by Aharonov, Albert, and Vaidman 
(AAV) They claimed that if a system is preselected 
in an initial quantum state \ipi), and post-selected on a 
final state \ipf), then the result of weakly measuring the 
operator A was not its expectation value, but rather its 
weak value: 
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In the WV literature, it is customary to point out that 
for a bounded operator the WV can wildly exceed the 
eigenvalue range, and, unlike the expectation value, can 
be complex. 

The WV has been both a theoretically and experimen- 
tally fruitful concept because of these surprising features. 
Theoretically, WVs have helped to understand counter- 
intuitive results that involve postselection, even resolv- 
ing difficulties that arise in quantum mechanics such as 
Hardy's paradox and apparent superluminal travel Q. 
They can be further seen as a single-system test of quan- 
tum mechanics: whenever the WV exceeds the eigenvalue 
range, it rules out certain classes of hidden variable the- 
ories and is equivalent to violating a generalized Leggett- 
Garg inequality Q . On the experimental side, WVs that 
exceed the eigenvalue range have been observed in opti- 
cal systems 0, [B| . Indeed, WVs have recently found a 
practical application, because an anomalously large WV 
may be used to amplify a small shift in a system param- 
eter. This idea has been exploited in polarization [f| and 
interferometry Q experiments to achieve subpicometer 
position resolution. 

In spite of this list of increasingly impressive accom- 
plishments and insights, there is a long history of con- 
troversy regarding WVs @. Our primary concern here 
is not the ample philosophical discussion, but rather the 
physics controversy which we now detail, (i) As defined, 
the WV (03 is generally complex, so how can it be mea- 
sured? This was artfully put by Landauer when he asked, 
"Has anyone seen a stopwatch with complex numbers on 
its dial?" Q . (Both the real and imaginary parts of the 
WV expression arc physically relevant, but are measured 



with different experiments, see e.g. (ii) Another 

objection is that the AAV formula ((T|) is not unique, and 
that other results for the WV can arise with a slightly dif- 
ferent measurement setup [ll| . (iii) The AAV formula ([T]) 
only applies to pure states and arbitrarily weak measure- 
ments, so is there a generalization to the cases of mixed 
states and finite strength measurements? (iv) Perhaps 
most worrisome, if we view the WV as a weighted average 
of the eigenvalues of the operator, and the WV exceeds 
the eigenvalue range, then we can immediately conclude 
that there must be negative probabilities weighting the 
eigenvalues. This is dramatically illustrated in the three- 
box problem [§[. Taken together, these objections are a 
formidable challenge to the theoretical consistency of the 
concept of the WV. 

The purpose of this Letter is to take the thesis of the 
second paragraph together with the antithesis of the third 
and to develop a synthesis. This will be accomplished by 
going beyond thinking about an observable in terms of 
its eigenvalues and advancing the concept that the mea- 
surement results must be interpreted within their own 
context. This idea will give rise to the notion of the 
contextual values (CV) of an observable. The synthesis 
occurs by showing how the WV can be subsumed as a 
special case in the CV formalism. This formalism con- 
tains additional information that is able to clarify the 
objections listed above in an organic fashion. In addi- 
tion, we will derive a generalized WV formula that can 
be practically applied to a much larger class of problems 
that involve postselected averages. We also provide illus- 
trative examples, and demonstrate that we can recover 
a number of predictions for postselected averages in the 
literature that had been calculated for specific situations 
using other, more involved techniques. 

Contextual values. — The idea behind CV stems from 
the observation that the intrinsically measurable quanti- 
ties in the quantum theory are the outcome probabilities 
for a particular measurement setup. In order to calculate 
averages of an observable from the measured probabili- 
ties, it is empirically necessary to assign values to each 
outcome as a separate step. This secondary step becomes 
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implicit in the operator formalism, so is rarely mentioned 
explicitly. 

Under projective measurements such a value assign- 
ment seems trivial: the outcomes are perfectly distin- 
guishable and easily assigned meaningful values, which 
become the eigenvalues for the observable operator. How- 
ever, when generalizing the measurement to nonprojec- 
tive outcomes, one must be careful to correctly match 
the measured probabilities to a consistent value set. We 
will now show that the observable operator can be ex- 
panded in multiple ways corresponding to the possible 
measurement strategies and value assignments (|4]). 

To illustrate how the observable values change, we con- 
sider a general measurement that is fully characterized by 
a set of N measurement operators, 9Jt = {Mj}, which 
represent the operation of an V-outcome measurement 
apparatus on the system. This measurement context gen- 
erates a corresponding positive operator- valued measure 
(POVM), <£ = {Ej = M]Mj], which, in conjunction 
with an arbitrary density operator p, determines a prob- 
ability measure, Pj = Tr[Ejp], on the space of outcomes. 
Empirically, these probabilities can be measured as the 
relative frequencies of obtaining each outcome. 

Suppose that we wish to reconstruct the average value 
of an observable A using this setup. In order to obtain 
the correct average as a weighted sum, 

(A)=Y / a j P i = ^2a j 1t[E j p], (2) 

3 3 

we must explicitly assign a set of values {ctj} to the N 
outcomes of the apparatus. 

We require that such an average be independent of 9J£, 
and thus a property of the state p itself, which constrains 
the possible value assignments. Specifically, <j2j) should 
be equivalent to the average under a projective context, 

(^l)=^a fc Tr[n fc p]=Tr[ip], (3) 

fc 

where we have identified the projection-valued measure 
(PVM) and the corresponding eigenvalues {a^} as the 
spectral decomposition of a Hermitian operator, A. 

We now assert that the values {ctj} should be deter- 
mined solely by 9J£ so the equality between ([2]) and ([3]) 
should hold for any state p. It follows that the operator 
A can be expanded in multiple ways corresponding to 
possible contexts, 

A = 5>*4- = $>*n fc . ( 4 ) 

3 k 

This operator identity defines the contextual values {ctj} 
of an operator A under a compatible measurement con- 
text 9Xt as a generalization of its eigenvalues that allows 
the full empirical reconstruction of the associated physi- 
cal observable A. 



Indeed we can use this operator equality to obtain the 
n th moment of A using the same experimental setup, 

(A n )= ct n ...a J Ti[E Jl ...E Jn p], (5) 

3l, — ,3n 

provided that the N measurement operators in 271 and A 
all commute. In that case the correlation probabilities, 
/', .... ljn = Tr[M jn ■ ■ ■ M n pM)^ ■ ■ ■ Mjj, appearing in 
([5]) can be measured as the relative frequencies of the 
N n outcome sequences of n repeated measurements. 

In addition to permitting the reconstruction of the mo- 
ments of the observable, its CV are of great interest in 
their own right. Note that the moments of the CV them- 
selves, ^jCt^Pi, only coincide with the moments of A 
for n. = l. Such moments nevertheless contain important 
physics about both the context 971 and the observable A, 
as we consider shortly. 

The construction in general, will fail if N < dimA, 
will be unique UN — dimA, and will be underspecificd if 
N > dimA. The latter case results in an infinite number 
of possible solutions, {ay}. As such, we propose that the 
physically sensible choice of CV is the least redundant set 
uniquely related to the eigenvalues through the Moorc- 
Pcnrose pseudoinverse. 

To illustrate the construction of the least redundant 
set of CV, we consider the case when {Mj} and A all 
commute. It follows that they can all be diagonalized in 
the same PVM, and (j4|) is isomorphic to a matrix equa- 
tion a = Fa = j FkjCtj), with the components of F 

given by, Fkj = Tr[IlkEj]. To find the pseudoinverse of 
the matrix, F, we find its singular value decomposition, 
F = IT£V T , where the orthogonal matrices, U and V, 
are composed of the eigenvectors of FF T and F T F, re- 
spectively, and S is a diagonal matrix composed of the 
singular values of F [l2| . The pseudoinverse of F can be 
constructed as, F + = V"E + U T , where S + is the diago- 
nal matrix constructed from S T by inverting all nonzero 
singular values. It then follows that a uniquely specified 
set of CV can be defined as c?o = F + a. Other solutions 
of (j4|) can be written a = ctQ + x, where x is in the null 
space of F. 

Conditioned averages. — Suppose we now wish to gener- 
alize the notion of an observable average by conditioning 
the average on the result of a second measurement. To do 
so we weight the CV, {ctj }, already determined by the 
first measurement context, Wt^ 1 ', with a set of conditional 
probabilities generated from a second measurement. 

A second measurement is performed on the system 
with a context, 9Jt' 2 ) = {M^}, yielding the full con- 
text for the (ordered) sequential measurements, 9Tt^ 1,2 ^ = 
{Mf ] Mf ] }. The corresponding POVM, (S^ 2 ) = 
{E^ f ' 2) = M^M^M^M^}, gives the measurable 
probabilities, P jf = Ty[E^ 2) p] and P f = £\ P jf . From 
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these probabilities we can define a conditional probabil- 
ity in the usual way, Pj\f = Pjf/Pf, and thus define our 
main result, the conditioned average of an observable, 
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(6) 



Unlike (.4), f(A) is explicitly dependent on SO?' 1 ) and the 
CV. Hence, it must generally be interpreted as encoding 
information not just about the observable A, but also 
about the measurement context itself. 

Conditioned averages must be bounded by the mini- 
mum and maximum CV. However, since the CV range 
is not the same as the eigenvalue range for the observ- 
able, values of / (A) outside the eigenvalue range may be 
obtained. 

Weak values. — To find the weak limit of ^ we note 
that any measurement context continuously connected to 
the identity operation can be decomposed into the form 

"1/2 

9JT = {Uj(g)Ej (g)}, where g is a measurement strength 
parameter, and Uj(g) = exp[igGj] are unitary opera- 
tors generated by Hermitian operators Gj by Stone's 
theorem. Since the POVM elements are bounded and 
partition unity, they can also be expanded as Ej{g) = 
Pj i + gE'j +0(g 2 ), where = 1. 

Writing the initial context in ([5]) to first order in g, we 
find that as g — > 0, the weak limit generally depends ex- 
plicitly on {Gj} and {<xT, and thus will change depend- 
ing on how it is measured and how the CV are chosen 
(see also [H[). However, if Vj, [Gj,p] = 0, so the state is 
minimally disturbed, then the context-dependence van- 
ishes and a generalized WV [l3| is uniquely defined as 
the quantity 



A w =Tr{E ( f ) {A,p}}/2Tr[Ey ) p} 



( 2 ). 



(7) 



where {•, •} denotes the anticommutator. For a pure ini- 
tial state, p = IV'iXV'il) a "P ure POVM," Vj, Uj = i, and 

" (2) 

a strong final measurement, Ej = \^pf)(ipf\ (as consid- 
ered by AAV and most subsequent applications), ([7]) can 
be written as the real part of ([1]) [14J . 

Photon polarization. — As a first example, a photon po- 
larization measurement of tunable strength is detailed in 
Q and can be readily interpreted using CV. The Stokes 
observable S z is measured in the horizontal- vertical po- 
larization basis with the context M+ = 'ytljj + y&v 
and M_ = ytl H + yfl v , leading to the POVM, E± = 
(1/2) (i ± g& z ), where g = 7 2 - 7 2 . The CV for this 
scenario are uniquely ±l/<7, which exceed the eigenvalue 
range for \g\ < 1. 

Computing the conditioned average ^ of S z with an 
initial state of = a\H) + /3\V) and a final state |/) = 
(\H)-\V))/V2 yields 
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f(<r z ) = -\.l 
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<T z (q) P(q|f) 
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= 0.1 



-0.29 



2 )/(l-4 7 7Re[a/3*]), (8) 



FIG. 1: (color online) CV (left) and conditioned CV (right) 
computed using the setup described above Eq. (JTDJ with a 
coupling strength g — 0.1 and a pointer distribution width 
a = 0.3. The preparation state is ^(47^/32), and postselec- 
tion state is f(n/2). Two detector distributions are compared: 
Gaussian (top) with conditioned average (right, dotted line) 
violating the eigenvalue range (dashed lines), and box (bot- 
tom) showing the effect of the shape on the WV convergence 
rate. Strong measurement regime CV for each distribution 
with g — 1.0 are also shown for comparison (left, dotted line). 

which is the result obtained in [B| through direct compu- 
tation using an ancilla system. 

AAV setup. — For the original AAV/von-Neumann 
setup [l[, implemented using photons in an inter- 
action Hamiltonian ffw = gS(t — to)& z ®po of a qubit 
with a detector having momentum operator pu leads to 
a continuous set of measurement operators on the system 
M(q) = (q — g& z \&D), where |<I>_d) is the initial detector 
state, q is the position of the detector pointer, and g is 
the coupling strength. Hence the POVM is simply the 
initial detector probability density shifted in position by 
an operator, E(q) = P D (q - gcr z ) = \(q - ga z \$ D )\ 2 . 

Performing the pseudoinversion [12| leads to the least 
redundant set of CV, cr z (q), for the z-component of the 
pseudospin, 

<?M = [P D {q - 9) -P D (q + g)} I (a - b(g)) , (9) 
a = J P D (q) 2 dq- b[g) = J P D (q - g)P D (q + g) dq. 

From these expressions it is immediately clear that if the 
shift g is small compared to the width of the initial dis- 
tribution Pd(<7), then the POVM will be nearly propor- 
tional to the identity and the measure of overlap, b(g), 
will be nearly equal to the distribution constant a. Hence, 
the measurement will be weak and the CV will diverge 
independently of any specific system states. 

For a Gaussian distribution, Po{q) = 
cxp[-g 2 /2tT 2 ]/crv / 27r, a = 1/2(7 y^r and b(g) = 
aexp[— (g/a) 2 }. Hence, the CV are o- z (q) = 
v / 2exp[- g 2 /2cr 2 ] [sinh(g 3 /CT 2 )/sinh(. g 2 /2a 2 )] Q. With 
an initial state ip( a ) = (cos(a/2), sin(a/2)) in the z basis 
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which matches the result in 0] computed using the quan- 
tum Baycsian approach 15 [. 



-2-10 1 2 3 

Dimensionless measurement result u 

FIG. 2: (color online) QPC CV for a range of measurement 
times and a fixed coupling constant. For small times a z (u) — > 
2\f2u , while for long times a z (u) is only nonzero at ±1. 

conditioned on a final state /(tt/2) = (2)" 1 / 2 (1,1), (JHJ) 
yields 



f(S z ) = cos a / (1 + sincvcxp[— g 2 /2a 2 }). 



(10) 



The strong limit as (g/a) — > oo is cos a and the weak 
limit as (g/a) -> is the WV cot(a/2 + tt/4). The 
same WV is also obtained for any Pd (q) according to (JT]) , 
though the speed of convergence depends on the distri- 
bution shape. Figure Q] illustrates the effect of the shape 
by comparing Gaussian and box distributions. 

Quantum point contact ( QPC ) detector. — A general- 
ization of the AAV result to continuous measurements in 
solid state detectors is given in Q. The measurement 
operators are constructed in terms of an average current 
through a QPC, / = {l/t) J Q I(t')dt', and two charac- 
teristic currents, I\ and li that indicate the position of 
an electron in a nearby double quantum dot. The cur- 
rent detection probabilities are assumed Gaussian with 
a variance of a 2 — Si/2t, where Si is the detector shot 
noise power and t is the averaging time for the current. 

The system can be mapped to the AAV case by making 
the identifications Iq = (I\ + 1%)/2, q = I — Iq, g = (I\ — 
l2)/2, and o 2 = Si/2t. Since the characteristic currents 
for the electron position are fixed, the coupling strength 
g is also fixed. Hence it is useful to consider instead 
the dimensionless variables, u = q/g, and r = (g/a) 2 = 
t/T m , where T m is the characteristic measurement time. 
Figure [2] shows a range of CV for varying t. 

The system is started with an initially arbitrary den- 
sity matrix p = [pij] written in the z basis. After be- 
ing measured for the averaging time t, it is then rotated 
around the or x axis by an angle 9 and measured strongly 
to be in a final state |/) = | + 1) along the new z axis. 
Computing the conditioned average using ([6]) yields 



<(S,) = 



COS 2 (|)pn 



sin 2 (|)p 2 2 



Conclusions. — We have shown how to completely re- 
construct an observable using an arbitrary measurement 
setup by defining contextual values ([J), or generalized 
eigenvalues, that account for the measurement context. 
The approach allows a conditioned average (|6|) to be de- 
fined that correctly reproduces existing results in the 
literature and converges uniquely to a generalized weak 
value ([7|) in the minimum disturbance limit. 

This conditioned average addresses much of the con- 
troversy surrounding weak values in the literature, since 
it (i) is a purely real empirical average, (ii) converges 
to the real part of the standard weak value (Q]) uniquely 
in the minimum disturbance limit with pure initial and 
final states, (iii) applies to any measurement strength, 
arbitrary quantum density operators, and POVM postsc- 
lections, and, (iv) is constructed from well-behaved, mea- 
surable probabilities. 

This work was supported by the NSF Grant No. DMR- 
0844899, ARO Grant No. W911NF-09-1-0417, and a 
DARPA DSO Slow Light grant. 
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